Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 3314 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 294.6 KiB |
| Average record size in memory | 91.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 3 |
| Boolean | 3 |
LargestPropertyUseTypeGFA is highly overall correlated with TotalGHGEmissions and 4 other fields | High correlation |
TotalGHGEmissions is highly overall correlated with LargestPropertyUseTypeGFA and 4 other fields | High correlation |
SiteEnergyUse_kBtu_ is highly overall correlated with LargestPropertyUseTypeGFA and 4 other fields | High correlation |
LargestPropertyUseTypeGFA_log is highly overall correlated with LargestPropertyUseTypeGFA and 4 other fields | High correlation |
TotalGHGEmissions_log is highly overall correlated with LargestPropertyUseTypeGFA and 5 other fields | High correlation |
SiteEnergyUse_kBtu_log is highly overall correlated with LargestPropertyUseTypeGFA and 4 other fields | High correlation |
BuildingType is highly overall correlated with PrimaryPropertyType | High correlation |
Have_NaturalGas_Energy is highly overall correlated with TotalGHGEmissions_log | High correlation |
PrimaryPropertyType is highly overall correlated with BuildingType | High correlation |
Have_Stream_Energy is highly imbalanced (76.3%) | Imbalance |
Have_Electricity_Energy is highly imbalanced (99.6%) | Imbalance |
NumberofBuildings is highly skewed (γ1 = 43.11529862) | Skewed |
LargestPropertyUseTypeGFA is highly skewed (γ1 = 30.22421764) | Skewed |
SiteEnergyUse_kBtu_ is highly skewed (γ1 = 24.69850505) | Skewed |
SiteEnergyUse_kBtu_ has unique values | Unique |
SiteEnergyUse_kBtu_log has unique values | Unique |
NumberofBuildings has 92 (2.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-20 13:11:24.563326 |
|---|---|
| Analysis finished | 2023-06-20 13:11:58.211299 |
| Duration | 33.65 seconds |
| Software version | ydata-profiling vv4.2.0 |
| Download configuration | config.json |
YearBuilt
Real number (ℝ)
| Distinct | 113 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1968.6976 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1908 |
| Q1 | 1948 |
| median | 1975 |
| Q3 | 1997 |
| 95-th percentile | 2012 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 33.059519 |
|---|---|
| Coefficient of variation (CV) | 0.016792583 |
| Kurtosis | -0.86857772 |
| Mean | 1968.6976 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | -0.54205395 |
| Sum | 6524264 |
| Variance | 1092.9318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2000 | 71 | 2.1% |
| 2014 | 67 | 2.0% |
| 2008 | 64 | 1.9% |
| 1968 | 63 | 1.9% |
| 1989 | 63 | 1.9% |
| 1999 | 63 | 1.9% |
| 1988 | 61 | 1.8% |
| 2001 | 59 | 1.8% |
| 2002 | 58 | 1.8% |
| 1990 | 58 | 1.8% |
| Other values (103) | 2687 |
| Value | Count | Frequency (%) |
| 1900 | 53 | |
| 1901 | 8 | 0.2% |
| 1902 | 11 | 0.3% |
| 1903 | 3 | 0.1% |
| 1904 | 14 | 0.4% |
| 1905 | 9 | 0.3% |
| 1906 | 18 | 0.5% |
| 1907 | 31 | |
| 1908 | 27 | |
| 1909 | 31 |
| Value | Count | Frequency (%) |
| 2015 | 35 | |
| 2014 | 67 | |
| 2013 | 50 | |
| 2012 | 35 | |
| 2011 | 15 | 0.5% |
| 2010 | 24 | 0.7% |
| 2009 | 41 | |
| 2008 | 64 | |
| 2007 | 42 | |
| 2006 | 45 |
BuildingType
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| NonResidential | |
|---|---|
| Multifamily LR (1-4) | |
| Multifamily MR (5-9) | |
| Multifamily HR (10+) | 109 |
| Nonresidential COS | 84 |
| Other values (3) | 108 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 17.166566 |
| Min length | 6 |
Characters and Unicode
| Total characters | 56890 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NonResidential |
|---|---|
| 2nd row | NonResidential |
| 3rd row | NonResidential |
| 4th row | NonResidential |
| 5th row | NonResidential |
Common Values
| Value | Count | Frequency (%) |
| NonResidential | 1439 | |
| Multifamily LR (1-4) | 996 | |
| Multifamily MR (5-9) | 578 | |
| Multifamily HR (10+) | 109 | 3.3% |
| Nonresidential COS | 84 | 2.5% |
| SPS-District K-12 | 83 | 2.5% |
| Campus | 24 | 0.7% |
| Nonresidential WA | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| multifamily | 1683 | |
| nonresidential | 1524 | |
| lr | 996 | |
| 1-4 | 996 | |
| mr | 578 | 8.4% |
| 5-9 | 578 | 8.4% |
| hr | 109 | 1.6% |
| 10 | 109 | 1.6% |
| cos | 84 | 1.2% |
| sps-district | 83 | 1.2% |
| Other values (3) | 108 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 6580 | 11.6% |
| l | 4890 | 8.6% |
| 3534 | 6.2% | |
| t | 3373 | 5.9% |
| a | 3231 | 5.7% |
| R | 3122 | 5.5% |
| n | 3048 | 5.4% |
| e | 3048 | 5.4% |
| M | 2261 | 4.0% |
| - | 1740 | 3.1% |
| Other values (30) | 22063 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35904 | |
| Uppercase Letter | 8705 | 15.3% |
| Space Separator | 3534 | 6.2% |
| Decimal Number | 3532 | 6.2% |
| Dash Punctuation | 1740 | 3.1% |
| Open Punctuation | 1683 | 3.0% |
| Close Punctuation | 1683 | 3.0% |
| Math Symbol | 109 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6580 | |
| l | 4890 | |
| t | 3373 | |
| a | 3231 | |
| n | 3048 | |
| e | 3048 | |
| u | 1707 | 4.8% |
| m | 1707 | 4.8% |
| f | 1683 | 4.7% |
| y | 1683 | 4.7% |
| Other values (6) | 4954 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3122 | |
| M | 2261 | |
| N | 1524 | |
| L | 996 | 11.4% |
| S | 250 | 2.9% |
| H | 109 | 1.3% |
| C | 108 | 1.2% |
| O | 84 | 1.0% |
| P | 83 | 1.0% |
| D | 83 | 1.0% |
| Other values (3) | 85 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1188 | |
| 4 | 996 | |
| 5 | 578 | |
| 9 | 578 | |
| 0 | 109 | 3.1% |
| 2 | 83 | 2.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3534 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1740 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1683 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1683 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 109 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44609 | |
| Common | 12281 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6580 | |
| l | 4890 | |
| t | 3373 | 7.6% |
| a | 3231 | 7.2% |
| R | 3122 | 7.0% |
| n | 3048 | 6.8% |
| e | 3048 | 6.8% |
| M | 2261 | 5.1% |
| u | 1707 | 3.8% |
| m | 1707 | 3.8% |
| Other values (19) | 11642 |
Common
| Value | Count | Frequency (%) |
| 3534 | ||
| - | 1740 | |
| ( | 1683 | |
| ) | 1683 | |
| 1 | 1188 | 9.7% |
| 4 | 996 | 8.1% |
| 5 | 578 | 4.7% |
| 9 | 578 | 4.7% |
| 0 | 109 | 0.9% |
| + | 109 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56890 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 6580 | 11.6% |
| l | 4890 | 8.6% |
| 3534 | 6.2% | |
| t | 3373 | 5.9% |
| a | 3231 | 5.7% |
| R | 3122 | 5.5% |
| n | 3048 | 5.4% |
| e | 3048 | 5.4% |
| M | 2261 | 4.0% |
| - | 1740 | 3.1% |
| Other values (30) | 22063 |
Neighborhood
Categorical
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| DOWNTOWN | |
|---|---|
| EAST | |
| MAGNOLIA / QUEEN ANNE | |
| GREATER DUWAMISH | |
| NORTHEAST | |
| Other values (14) |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 10.118286 |
| Min length | 4 |
Characters and Unicode
| Total characters | 33532 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | DOWNTOWN |
|---|---|
| 2nd row | DOWNTOWN |
| 3rd row | DOWNTOWN |
| 4th row | DOWNTOWN |
| 5th row | DOWNTOWN |
Common Values
| Value | Count | Frequency (%) |
| DOWNTOWN | 562 | |
| EAST | 448 | |
| MAGNOLIA / QUEEN ANNE | 415 | |
| GREATER DUWAMISH | 371 | |
| NORTHEAST | 274 | |
| LAKE UNION | 249 | |
| NORTHWEST | 208 | 6.3% |
| SOUTHWEST | 157 | 4.7% |
| NORTH | 142 | 4.3% |
| BALLARD | 124 | 3.7% |
| Other values (9) | 364 |
Length
| Value | Count | Frequency (%) |
| downtown | 562 | |
| east | 448 | 8.6% |
| magnolia | 415 | 8.0% |
| 415 | 8.0% | |
| queen | 415 | 8.0% |
| anne | 415 | 8.0% |
| greater | 371 | 7.2% |
| duwamish | 371 | 7.2% |
| northeast | 274 | 5.3% |
| union | 249 | 4.8% |
| Other values (9) | 1245 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 4060 | |
| E | 3679 | |
| A | 3401 | |
| T | 3090 | 9.2% |
| O | 2666 | 8.0% |
| 1866 | 5.6% | |
| W | 1860 | 5.5% |
| S | 1804 | 5.4% |
| R | 1672 | 5.0% |
| U | 1286 | 3.8% |
| Other values (24) | 8148 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 30773 | |
| Space Separator | 1866 | 5.6% |
| Lowercase Letter | 478 | 1.4% |
| Other Punctuation | 415 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4060 | |
| E | 3679 | |
| A | 3401 | |
| T | 3090 | |
| O | 2666 | |
| W | 1860 | 6.0% |
| S | 1804 | 5.9% |
| R | 1672 | 5.4% |
| U | 1286 | 4.2% |
| H | 1248 | 4.1% |
| Other values (9) | 6007 |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 89 | |
| t | 89 | |
| o | 52 | |
| h | 52 | |
| e | 45 | |
| l | 44 | |
| a | 40 | |
| n | 26 | 5.4% |
| w | 11 | 2.3% |
| s | 11 | 2.3% |
| Other values (3) | 19 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1866 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 415 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31251 | |
| Common | 2281 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 4060 | |
| E | 3679 | |
| A | 3401 | |
| T | 3090 | |
| O | 2666 | |
| W | 1860 | 6.0% |
| S | 1804 | 5.8% |
| R | 1672 | 5.4% |
| U | 1286 | 4.1% |
| H | 1248 | 4.0% |
| Other values (22) | 6485 |
Common
| Value | Count | Frequency (%) |
| 1866 | ||
| / | 415 | 18.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 4060 | |
| E | 3679 | |
| A | 3401 | |
| T | 3090 | 9.2% |
| O | 2666 | 8.0% |
| 1866 | 5.6% | |
| W | 1860 | 5.5% |
| S | 1804 | 5.4% |
| R | 1672 | 5.0% |
| U | 1286 | 3.8% |
| Other values (24) | 8148 |
Have_Stream_Energy
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| False | |
|---|---|
| True | 129 |
| Value | Count | Frequency (%) |
| False | 3185 | |
| True | 129 | 3.9% |
Have_Electricity_Energy
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| True | |
|---|---|
| False | 1 |
| Value | Count | Frequency (%) |
| True | 3313 | |
| False | 1 | < 0.1% |
Have_NaturalGas_Energy
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 2090 | |
| False | 1224 |
PrimaryPropertyType
Categorical
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Low-Rise Multifamily | |
|---|---|
| Mid-Rise Multifamily | |
| Small- and Mid-Sized Office | |
| Other | |
| Warehouse | |
| Other values (19) |
Length
| Max length | 27 |
|---|---|
| Median length | 22 |
| Mean length | 17.213941 |
| Min length | 5 |
Characters and Unicode
| Total characters | 57047 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Hotel |
Common Values
| Value | Count | Frequency (%) |
| Low-Rise Multifamily | 966 | |
| Mid-Rise Multifamily | 561 | |
| Small- and Mid-Sized Office | 288 | 8.7% |
| Other | 250 | 7.5% |
| Warehouse | 187 | 5.6% |
| Large Office | 170 | 5.1% |
| Mixed Use Property | 132 | 4.0% |
| K-12 School | 123 | 3.7% |
| High-Rise Multifamily | 104 | 3.1% |
| Retail Store | 89 | 2.7% |
| Other values (14) | 444 |
Length
| Value | Count | Frequency (%) |
| multifamily | 1631 | |
| low-rise | 966 | |
| mid-rise | 561 | 8.1% |
| office | 500 | 7.3% |
| small | 288 | 4.2% |
| and | 288 | 4.2% |
| mid-sized | 288 | 4.2% |
| other | 250 | 3.6% |
| warehouse | 199 | 2.9% |
| large | 170 | 2.5% |
| Other values (28) | 1745 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 7501 | 13.1% |
| e | 4476 | 7.8% |
| l | 4346 | 7.6% |
| 3572 | 6.3% | |
| a | 3002 | 5.3% |
| t | 2755 | 4.8% |
| f | 2671 | 4.7% |
| M | 2651 | 4.6% |
| - | 2358 | 4.1% |
| s | 2154 | 3.8% |
| Other values (33) | 21561 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42326 | |
| Uppercase Letter | 8506 | 14.9% |
| Space Separator | 3572 | 6.3% |
| Dash Punctuation | 2358 | 4.1% |
| Decimal Number | 246 | 0.4% |
| Other Punctuation | 39 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 7501 | |
| e | 4476 | |
| l | 4346 | |
| a | 3002 | 7.1% |
| t | 2755 | 6.5% |
| f | 2671 | 6.3% |
| s | 2154 | 5.1% |
| o | 2056 | 4.9% |
| m | 2048 | 4.8% |
| u | 1979 | 4.7% |
| Other values (14) | 9338 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2651 | |
| R | 1767 | |
| L | 1146 | |
| S | 967 | 11.4% |
| O | 750 | 8.8% |
| W | 268 | 3.2% |
| H | 213 | 2.5% |
| U | 157 | 1.8% |
| C | 143 | 1.7% |
| P | 132 | 1.6% |
| Other values (4) | 312 | 3.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 123 | |
| 2 | 123 |
Space Separator
| Value | Count | Frequency (%) |
| 3572 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2358 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50832 | |
| Common | 6215 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 7501 | |
| e | 4476 | 8.8% |
| l | 4346 | 8.5% |
| a | 3002 | 5.9% |
| t | 2755 | 5.4% |
| f | 2671 | 5.3% |
| M | 2651 | 5.2% |
| s | 2154 | 4.2% |
| o | 2056 | 4.0% |
| m | 2048 | 4.0% |
| Other values (28) | 17172 |
Common
| Value | Count | Frequency (%) |
| 3572 | ||
| - | 2358 | |
| 1 | 123 | 2.0% |
| 2 | 123 | 2.0% |
| / | 39 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57047 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 7501 | 13.1% |
| e | 4476 | 7.8% |
| l | 4346 | 7.6% |
| 3572 | 6.3% | |
| a | 3002 | 5.3% |
| t | 2755 | 4.8% |
| f | 2671 | 4.7% |
| M | 2651 | 4.6% |
| - | 2358 | 4.1% |
| s | 2154 | 3.8% |
| Other values (33) | 21561 |
NumberofBuildings
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1071213 |
| Minimum | 0 |
|---|---|
| Maximum | 111 |
| Zeros | 92 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 111 |
| Range | 111 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.1243398 |
|---|---|
| Coefficient of variation (CV) | 1.9187959 |
| Kurtosis | 2174.733 |
| Mean | 1.1071213 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 43.115299 |
| Sum | 3669 |
| Variance | 4.5128198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3123 | |
| 0 | 92 | 2.8% |
| 2 | 36 | 1.1% |
| 3 | 22 | 0.7% |
| 4 | 12 | 0.4% |
| 5 | 9 | 0.3% |
| 6 | 5 | 0.2% |
| 8 | 3 | 0.1% |
| 10 | 2 | 0.1% |
| 14 | 2 | 0.1% |
| Other values (7) | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 92 | 2.8% |
| 1 | 3123 | |
| 2 | 36 | 1.1% |
| 3 | 22 | 0.7% |
| 4 | 12 | 0.4% |
| 5 | 9 | 0.3% |
| 6 | 5 | 0.2% |
| 7 | 1 | < 0.1% |
| 8 | 3 | 0.1% |
| 9 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 111 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 14 | 2 | |
| 11 | 1 | < 0.1% |
| 10 | 2 | |
| 9 | 2 | |
| 8 | 3 | |
| 7 | 1 | < 0.1% |
LargestPropertyUseTypeGFA
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3085 |
|---|---|
| Distinct (%) | 93.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79221.59 |
| Minimum | 5656 |
|---|---|
| Maximum | 9320156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 5656 |
|---|---|
| 5-th percentile | 17521.8 |
| Q1 | 25122.25 |
| median | 39894 |
| Q3 | 76799 |
| 95-th percentile | 244684.8 |
| Maximum | 9320156 |
| Range | 9314500 |
| Interquartile range (IQR) | 51676.75 |
Descriptive statistics
| Standard deviation | 202184.9 |
|---|---|
| Coefficient of variation (CV) | 2.5521439 |
| Kurtosis | 1324.495 |
| Mean | 79221.59 |
| Median Absolute Deviation (MAD) | 17574 |
| Skewness | 30.224218 |
| Sum | 2.6254035 × 108 |
| Variance | 4.0878732 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24000 | 9 | 0.3% |
| 22000 | 8 | 0.2% |
| 30000 | 8 | 0.2% |
| 21600 | 7 | 0.2% |
| 20000 | 7 | 0.2% |
| 28800 | 6 | 0.2% |
| 15000 | 5 | 0.2% |
| 24288 | 5 | 0.2% |
| 36000 | 5 | 0.2% |
| 45000 | 5 | 0.2% |
| Other values (3075) | 3249 |
| Value | Count | Frequency (%) |
| 5656 | 1 | |
| 6455 | 1 | |
| 6601 | 1 | |
| 6900 | 1 | |
| 7245 | 1 | |
| 7387 | 1 | |
| 7501 | 1 | |
| 7583 | 1 | |
| 7758 | 1 | |
| 8061 | 1 |
| Value | Count | Frequency (%) |
| 9320156 | 1 | |
| 1719643 | 1 | |
| 1680937 | 1 | |
| 1639334 | 1 | |
| 1585960 | 1 | |
| 1350182 | 1 | |
| 1314475 | 1 | |
| 1191115 | 1 | |
| 1172127 | 1 | |
| 1011135 | 1 |
TotalGHGEmissions
Real number (ℝ)
| Distinct | 2782 |
|---|---|
| Distinct (%) | 83.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.62974 |
| Minimum | 0.4 |
|---|---|
| Maximum | 16870.98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 0.4 |
|---|---|
| 5-th percentile | 3.9365 |
| Q1 | 9.71 |
| median | 34.28 |
| Q3 | 94.2275 |
| 95-th percentile | 393.1155 |
| Maximum | 16870.98 |
| Range | 16870.58 |
| Interquartile range (IQR) | 84.5175 |
Descriptive statistics
| Standard deviation | 542.80206 |
|---|---|
| Coefficient of variation (CV) | 4.4997366 |
| Kurtosis | 468.37182 |
| Mean | 120.62974 |
| Median Absolute Deviation (MAD) | 28.13 |
| Skewness | 19.357192 |
| Sum | 399766.96 |
| Variance | 294634.07 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.95 | 7 | 0.2% |
| 4.2 | 6 | 0.2% |
| 4.02 | 5 | 0.2% |
| 3.63 | 5 | 0.2% |
| 4.76 | 5 | 0.2% |
| 6.18 | 5 | 0.2% |
| 5.07 | 5 | 0.2% |
| 4.15 | 5 | 0.2% |
| 4.8 | 5 | 0.2% |
| 9.29 | 5 | 0.2% |
| Other values (2772) | 3261 |
| Value | Count | Frequency (%) |
| 0.4 | 1 | |
| 0.63 | 1 | |
| 0.68 | 1 | |
| 0.75 | 1 | |
| 0.79 | 1 | |
| 0.81 | 1 | |
| 0.82 | 1 | |
| 0.86 | 1 | |
| 0.87 | 1 | |
| 0.89 | 1 |
| Value | Count | Frequency (%) |
| 16870.98 | 1 | |
| 12307.16 | 1 | |
| 11140.56 | 1 | |
| 10734.57 | 1 | |
| 8145.52 | 1 | |
| 6330.91 | 1 | |
| 4906.33 | 1 | |
| 3995.45 | 1 | |
| 3768.66 | 1 | |
| 3278.11 | 1 |
SiteEnergyUse_kBtu_
Real number (ℝ)
HIGH CORRELATION  SKEWED  UNIQUE 
| Distinct | 3314 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5450556.3 |
| Minimum | 57133.199 |
|---|---|
| Maximum | 8.7392371 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 57133.199 |
|---|---|
| 5-th percentile | 520937.92 |
| Q1 | 943547.03 |
| median | 1821625 |
| Q3 | 4232943.2 |
| 95-th percentile | 18251998 |
| Maximum | 8.7392371 × 108 |
| Range | 8.7386658 × 108 |
| Interquartile range (IQR) | 3289396.2 |
Descriptive statistics
| Standard deviation | 21773595 |
|---|---|
| Coefficient of variation (CV) | 3.9947473 |
| Kurtosis | 847.3068 |
| Mean | 5450556.3 |
| Median Absolute Deviation (MAD) | 1073179.6 |
| Skewness | 24.698505 |
| Sum | 1.8063144 × 1010 |
| Variance | 4.7408943 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7226362.5 | 1 | < 0.1% |
| 6714540 | 1 | < 0.1% |
| 1148202.25 | 1 | < 0.1% |
| 876569.6875 | 1 | < 0.1% |
| 488991.5 | 1 | < 0.1% |
| 1206165.75 | 1 | < 0.1% |
| 1302192.875 | 1 | < 0.1% |
| 150167.7969 | 1 | < 0.1% |
| 1386445.375 | 1 | < 0.1% |
| 1331469.75 | 1 | < 0.1% |
| Other values (3304) | 3304 |
| Value | Count | Frequency (%) |
| 57133.19922 | 1 | |
| 79711.79688 | 1 | |
| 90558.70313 | 1 | |
| 97690.39844 | 1 | |
| 106918 | 1 | |
| 111969.7031 | 1 | |
| 113130 | 1 | |
| 116486.6016 | 1 | |
| 117438.3984 | 1 | |
| 123767.2031 | 1 |
| Value | Count | Frequency (%) |
| 873923712 | 1 | |
| 448385312 | 1 | |
| 293090784 | 1 | |
| 291614432 | 1 | |
| 274682208 | 1 | |
| 253832464 | 1 | |
| 163945984 | 1 | |
| 143423024 | 1 | |
| 131373880 | 1 | |
| 114648520 | 1 |
LargestPropertyUseTypeGFA_log
Real number (ℝ)
| Distinct | 3085 |
|---|---|
| Distinct (%) | 93.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.6776795 |
| Minimum | 3.7525094 |
|---|---|
| Maximum | 6.9694232 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 3.7525094 |
|---|---|
| 5-th percentile | 4.2435787 |
| Q1 | 4.4000585 |
| median | 4.6009076 |
| Q3 | 4.8853552 |
| 95-th percentile | 5.3886068 |
| Maximum | 6.9694232 |
| Range | 3.2169138 |
| Interquartile range (IQR) | 0.48529669 |
Descriptive statistics
| Standard deviation | 0.37089589 |
|---|---|
| Coefficient of variation (CV) | 0.079290573 |
| Kurtosis | 1.3489038 |
| Mean | 4.6776795 |
| Median Absolute Deviation (MAD) | 0.22067407 |
| Skewness | 1.0256002 |
| Sum | 15501.83 |
| Variance | 0.13756376 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.380211242 | 9 | 0.3% |
| 4.342422681 | 8 | 0.2% |
| 4.477121255 | 8 | 0.2% |
| 4.334453751 | 7 | 0.2% |
| 4.301029996 | 7 | 0.2% |
| 4.459392488 | 6 | 0.2% |
| 4.176091259 | 5 | 0.2% |
| 4.385391754 | 5 | 0.2% |
| 4.556302501 | 5 | 0.2% |
| 4.653212514 | 5 | 0.2% |
| Other values (3075) | 3249 |
| Value | Count | Frequency (%) |
| 3.752509401 | 1 | |
| 3.809896247 | 1 | |
| 3.819609733 | 1 | |
| 3.838849091 | 1 | |
| 3.86003839 | 1 | |
| 3.868468099 | 1 | |
| 3.875119165 | 1 | |
| 3.879841056 | 1 | |
| 3.889749775 | 1 | |
| 3.906388921 | 1 |
| Value | Count | Frequency (%) |
| 6.969423182 | 1 | |
| 6.235438296 | 1 | |
| 6.225551437 | 1 | |
| 6.214667446 | 1 | |
| 6.20029223 | 1 | |
| 6.130392314 | 1 | |
| 6.118752331 | 1 | |
| 6.075953694 | 1 | |
| 6.06897467 | 1 | |
| 6.004809144 | 1 |
TotalGHGEmissions_log
Real number (ℝ)
| Distinct | 2782 |
|---|---|
| Distinct (%) | 83.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5286686 |
| Minimum | -0.39794001 |
|---|---|
| Maximum | 4.2271403 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 10 |
| Negative (%) | 0.3% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | -0.39794001 |
|---|---|
| 5-th percentile | 0.59510994 |
| Q1 | 0.98721923 |
| median | 1.5350407 |
| Q3 | 1.9741777 |
| 95-th percentile | 2.5945202 |
| Maximum | 4.2271403 |
| Range | 4.6250803 |
| Interquartile range (IQR) | 0.98695844 |
Descriptive statistics
| Standard deviation | 0.64537739 |
|---|---|
| Coefficient of variation (CV) | 0.42218267 |
| Kurtosis | -0.17943767 |
| Mean | 1.5286686 |
| Median Absolute Deviation (MAD) | 0.4830969 |
| Skewness | 0.27622109 |
| Sum | 5066.0078 |
| Variance | 0.41651198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5965970956 | 7 | 0.2% |
| 0.6232492904 | 6 | 0.2% |
| 0.6042260531 | 5 | 0.2% |
| 0.559906625 | 5 | 0.2% |
| 0.6776069527 | 5 | 0.2% |
| 0.7909884751 | 5 | 0.2% |
| 0.7050079593 | 5 | 0.2% |
| 0.6180480967 | 5 | 0.2% |
| 0.6812412374 | 5 | 0.2% |
| 0.968015714 | 5 | 0.2% |
| Other values (2772) | 3261 |
| Value | Count | Frequency (%) |
| -0.3979400087 | 1 | |
| -0.2006594505 | 1 | |
| -0.1674910873 | 1 | |
| -0.1249387366 | 1 | |
| -0.1023729087 | 1 | |
| -0.09151498112 | 1 | |
| -0.08618614762 | 1 | |
| -0.06550154876 | 1 | |
| -0.06048074738 | 1 | |
| -0.05060999336 | 1 |
| Value | Count | Frequency (%) |
| 4.227140311 | 1 | |
| 4.090157847 | 1 | |
| 4.046907022 | 1 | |
| 4.030784652 | 1 | |
| 3.910918814 | 1 | |
| 3.80146614 | 1 | |
| 3.690756756 | 1 | |
| 3.6015657 | 1 | |
| 3.576186958 | 1 | |
| 3.515623523 | 1 |
SiteEnergyUse_kBtu_log
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 3314 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.3401237 |
| Minimum | 4.7568885 |
|---|---|
| Maximum | 8.9414735 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 4.7568885 |
|---|---|
| 5-th percentile | 5.716786 |
| Q1 | 5.9747635 |
| median | 6.2604589 |
| Q3 | 6.6266424 |
| 95-th percentile | 7.2613093 |
| Maximum | 8.9414735 |
| Range | 4.184585 |
| Interquartile range (IQR) | 0.65187897 |
Descriptive statistics
| Standard deviation | 0.49374055 |
|---|---|
| Coefficient of variation (CV) | 0.07787554 |
| Kurtosis | 0.95099165 |
| Mean | 6.3401237 |
| Median Absolute Deviation (MAD) | 0.31456573 |
| Skewness | 0.79560926 |
| Sum | 21011.17 |
| Variance | 0.24377973 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.858919744 | 1 | < 0.1% |
| 6.827016265 | 1 | < 0.1% |
| 6.060018394 | 1 | < 0.1% |
| 5.942786448 | 1 | < 0.1% |
| 5.68930131 | 1 | < 0.1% |
| 6.081406992 | 1 | < 0.1% |
| 6.114675315 | 1 | < 0.1% |
| 5.176576809 | 1 | < 0.1% |
| 6.141902763 | 1 | < 0.1% |
| 6.124331304 | 1 | < 0.1% |
| Other values (3304) | 3304 |
| Value | Count | Frequency (%) |
| 4.756888543 | 1 | |
| 4.901522599 | 1 | |
| 4.956930194 | 1 | |
| 4.989851881 | 1 | |
| 5.029050826 | 1 | |
| 5.049100527 | 1 | |
| 5.053577787 | 1 | |
| 5.066275975 | 1 | |
| 5.06981012 | 1 | |
| 5.092605577 | 1 |
| Value | Count | Frequency (%) |
| 8.941473523 | 1 | |
| 8.651651378 | 1 | |
| 8.467002163 | 1 | |
| 8.464809013 | 1 | |
| 8.43883053 | 1 | |
| 8.404547166 | 1 | |
| 8.214700783 | 1 | |
| 8.156618875 | 1 | |
| 8.118509027 | 1 | |
| 8.059368453 | 1 |
| YearBuilt | NumberofBuildings | LargestPropertyUseTypeGFA | TotalGHGEmissions | SiteEnergyUse_kBtu_ | LargestPropertyUseTypeGFA_log | TotalGHGEmissions_log | SiteEnergyUse_kBtu_log | BuildingType | Neighborhood | Have_Stream_Energy | Have_Electricity_Energy | Have_NaturalGas_Energy | PrimaryPropertyType | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| YearBuilt | 1.000 | 0.038 | 0.291 | 0.027 | 0.161 | 0.291 | 0.027 | 0.161 | 0.158 | 0.176 | 0.157 | 0.026 | 0.338 | 0.186 |
| NumberofBuildings | 0.038 | 1.000 | 0.083 | 0.049 | 0.055 | 0.083 | 0.049 | 0.055 | 0.240 | 0.000 | 0.081 | 0.000 | 0.000 | 0.136 |
| LargestPropertyUseTypeGFA | 0.291 | 0.083 | 1.000 | 0.571 | 0.736 | 1.000 | 0.571 | 0.736 | 0.148 | 0.015 | 0.147 | 0.000 | 0.024 | 0.230 |
| TotalGHGEmissions | 0.027 | 0.049 | 0.571 | 1.000 | 0.878 | 0.571 | 1.000 | 0.878 | 0.126 | 0.000 | 0.198 | 0.000 | 0.034 | 0.259 |
| SiteEnergyUse_kBtu_ | 0.161 | 0.055 | 0.736 | 0.878 | 1.000 | 0.736 | 0.878 | 1.000 | 0.155 | 0.000 | 0.127 | 0.000 | 0.023 | 0.276 |
| LargestPropertyUseTypeGFA_log | 0.291 | 0.083 | 1.000 | 0.571 | 0.736 | 1.000 | 0.571 | 0.736 | 0.196 | 0.086 | 0.231 | 0.000 | 0.167 | 0.285 |
| TotalGHGEmissions_log | 0.027 | 0.049 | 0.571 | 1.000 | 0.878 | 0.571 | 1.000 | 0.878 | 0.216 | 0.095 | 0.353 | 0.031 | 0.691 | 0.315 |
| SiteEnergyUse_kBtu_log | 0.161 | 0.055 | 0.736 | 0.878 | 1.000 | 0.736 | 0.878 | 1.000 | 0.239 | 0.115 | 0.288 | 0.000 | 0.352 | 0.353 |
| BuildingType | 0.158 | 0.240 | 0.148 | 0.126 | 0.155 | 0.196 | 0.216 | 0.239 | 1.000 | 0.203 | 0.195 | 0.000 | 0.289 | 0.729 |
| Neighborhood | 0.176 | 0.000 | 0.015 | 0.000 | 0.000 | 0.086 | 0.095 | 0.115 | 0.203 | 1.000 | 0.285 | 0.000 | 0.154 | 0.200 |
| Have_Stream_Energy | 0.157 | 0.081 | 0.147 | 0.198 | 0.127 | 0.231 | 0.353 | 0.288 | 0.195 | 0.285 | 1.000 | 0.000 | 0.014 | 0.295 |
| Have_Electricity_Energy | 0.026 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.031 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.276 |
| Have_NaturalGas_Energy | 0.338 | 0.000 | 0.024 | 0.034 | 0.023 | 0.167 | 0.691 | 0.352 | 0.289 | 0.154 | 0.014 | 0.000 | 1.000 | 0.355 |
| PrimaryPropertyType | 0.186 | 0.136 | 0.230 | 0.259 | 0.276 | 0.285 | 0.315 | 0.353 | 0.729 | 0.200 | 0.295 | 0.276 | 0.355 | 1.000 |
| YearBuilt | BuildingType | Neighborhood | Have_Stream_Energy | Have_Electricity_Energy | Have_NaturalGas_Energy | PrimaryPropertyType | NumberofBuildings | LargestPropertyUseTypeGFA | TotalGHGEmissions | SiteEnergyUse_kBtu_ | LargestPropertyUseTypeGFA_log | TotalGHGEmissions_log | SiteEnergyUse_kBtu_log | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1927 | NonResidential | DOWNTOWN | True | True | True | Hotel | 1 | 88434.0 | 249.98 | 7226362.5 | 4.946619 | 2.397905 | 6.858920 |
| 1 | 1996 | NonResidential | DOWNTOWN | False | True | True | Hotel | 1 | 83880.0 | 295.86 | 8387933.0 | 4.923658 | 2.471086 | 6.923655 |
| 2 | 1969 | NonResidential | DOWNTOWN | True | True | True | Hotel | 1 | 756493.0 | 2089.28 | 72587024.0 | 5.878805 | 3.319997 | 7.860859 |
| 3 | 1926 | NonResidential | DOWNTOWN | True | True | True | Hotel | 1 | 61320.0 | 286.43 | 6794584.0 | 4.787602 | 2.457019 | 6.832163 |
| 4 | 1980 | NonResidential | DOWNTOWN | False | True | True | Hotel | 1 | 123445.0 | 505.01 | 14172606.0 | 5.091474 | 2.703300 | 7.151450 |
| 5 | 1999 | Nonresidential COS | DOWNTOWN | False | True | True | Other | 1 | 88830.0 | 301.81 | 12086616.0 | 4.948560 | 2.479734 | 7.082305 |
| 6 | 1926 | NonResidential | DOWNTOWN | False | True | True | Hotel | 1 | 81352.0 | 176.14 | 5758795.0 | 4.910368 | 2.245858 | 6.760332 |
| 7 | 1926 | NonResidential | DOWNTOWN | True | True | True | Other | 1 | 102761.0 | 221.51 | 6298131.5 | 5.011828 | 2.345393 | 6.799212 |
| 8 | 1904 | NonResidential | DOWNTOWN | False | True | True | Hotel | 1 | 163984.0 | 392.16 | 13723820.0 | 5.214801 | 2.593463 | 7.137475 |
| 9 | 1910 | Multifamily MR (5-9) | DOWNTOWN | True | True | True | Mid-Rise Multifamily | 1 | 56132.0 | 151.12 | 4573777.0 | 4.749211 | 2.179322 | 6.660275 |
| YearBuilt | BuildingType | Neighborhood | Have_Stream_Energy | Have_Electricity_Energy | Have_NaturalGas_Energy | PrimaryPropertyType | NumberofBuildings | LargestPropertyUseTypeGFA | TotalGHGEmissions | SiteEnergyUse_kBtu_ | LargestPropertyUseTypeGFA_log | TotalGHGEmissions_log | SiteEnergyUse_kBtu_log | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3304 | 1952 | Nonresidential COS | MAGNOLIA / QUEEN ANNE | False | True | False | Office | 1 | 13661.0 | 3.50 | 5.026677e+05 | 4.135482 | 0.544068 | 5.701281 |
| 3305 | 1912 | Nonresidential COS | EAST | False | True | True | Other | 1 | 23445.0 | 259.22 | 5.976246e+06 | 4.370050 | 2.413669 | 6.776428 |
| 3306 | 1994 | Nonresidential COS | CENTRAL | False | True | True | Mixed Use Property | 1 | 8108.0 | 60.81 | 1.813404e+06 | 3.908914 | 1.783975 | 6.258495 |
| 3307 | 1960 | Nonresidential COS | SOUTHEAST | False | True | True | Office | 1 | 15398.0 | 7.79 | 3.878100e+05 | 4.187464 | 0.891537 | 5.588619 |
| 3308 | 1982 | Nonresidential COS | DELRIDGE NEIGHBORHOODS | False | True | True | Other | 1 | 18261.0 | 20.33 | 9.320821e+05 | 4.261525 | 1.308137 | 5.969454 |
| 3309 | 1990 | Nonresidential COS | GREATER DUWAMISH | False | True | True | Office | 1 | 12294.0 | 20.94 | 8.497457e+05 | 4.089693 | 1.320977 | 5.929289 |
| 3310 | 2004 | Nonresidential COS | DOWNTOWN | False | True | True | Other | 1 | 16000.0 | 32.17 | 9.502762e+05 | 4.204120 | 1.507451 | 5.977850 |
| 3311 | 1974 | Nonresidential COS | MAGNOLIA / QUEEN ANNE | False | True | True | Other | 1 | 7583.0 | 223.54 | 5.765898e+06 | 3.879841 | 2.349355 | 6.760867 |
| 3312 | 1989 | Nonresidential COS | GREATER DUWAMISH | False | True | True | Mixed Use Property | 1 | 6601.0 | 22.11 | 7.194712e+05 | 3.819610 | 1.344589 | 5.857013 |
| 3313 | 1938 | Nonresidential COS | GREATER DUWAMISH | False | True | True | Mixed Use Property | 1 | 8271.0 | 41.27 | 1.152896e+06 | 3.917558 | 1.615634 | 6.061790 |